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Abstract 

The state-dependent multiple access channel (MAC) is considered where the state sequences are 
known causally to the encoders. First, a MAC with two independent states each known causally to one 
encoder is revisited, and a new achievable scheme inspired by the recently proposed noisy network 
coding is presented. This scheme is shown to achieve a rate region that is potentially larger than that 
provided by recent work for the same model. Next, capacity results are presented for a class of channels 
that include modulo-additive state-dependent MACs. It is shown that the proposed scheme can be easily 
extended to an arbitrary number of users. Next, a similar scheme is proposed for a MAC with common 
state known causally to all encoders. The corresponding achievable rate region is shown to reduce to the 
one given in the previous work as a special case for two users. Finally, output feedback is introduced 
in the model at hand with independent states and an example is provided to show the advantage of 
feedback in enlarging the capacity region. 



The work of M. Li and A. Yener was supported in part by the National Science Foundation under Grants CNS 0716325, 
CCF 0964362 and DARPA ITMANET Program under Grant W91 lNF-07- 1-0028. The work of O. Simeone was supported in 
part by the National Science Foundation under Grant CCF 0914899. 



2 



I. Introduction 

State-dependent channels model relevant phenomena in wireless communication links, such as 
fading and interference. A usual model is that there exists a state sequence s" = (si, S2, Sn) 
with Si, i = l,2,...,n, denoting the state value affecting the channel at time instant i. Un- 
derstanding the merit of state information, i.e., information about the sequence s", for reliable 
communication is a key problem of both theoretical and practical interest. State-dependent chan- 
nels are mainly classified into three groups with respect to the availability of state information at 
the encoders: (i) non-causal state information where the encoders know the entire state sequence 
s" before encoding for the current block; (ii) causal state information where at channel use 
i, the encoders know all states up to and including at i; (Hi) strictly casual state information 
where at channel use i the encoders know a delayed version of state. Here we shall consider 
setting (Hi) which is most realistic. We focus on models where the decoders do not have channel 
state information. As an example, in a cognitive radio setup [Hl-lEIl, when the state models the 
interference from other users, cognitive transmitters, capable of sensing their environment, can 
be cognizant of the interference in a causal fashion, while the corresponding receivers are not 
able to measure the interference. 

In a point-to-point discrete memoryless channel (DMC), Shannon BU has show that strictly 
causal feedback of the received symbols from receiver to transmitter, i.e., output feedback, does 
not increase the capacity. It can be shown that strictly causal state information at the encoder 
has the same effect for a memoryless state-dependent channel with independently and identically 
distributed (i.i.d.) state sequence s". On the contrary, causal and non-causal state information 
turns out to be useful for point-to-point memoryless channels and capacity-achieving strategies 
have been derived for the former Q and the latter iH, |I3. 

State-dependent multi-user channels have also been investigated, see |[8l and references therein. 
In particular, we summarize existing results on state-dependent multiple access channels (MAC), 
which are the focus of our work. Reference flUl derived single-letter inner and outer bounds on the 
capacity region for two-user MACs with causal state information at the transmitters. Reference 
ifTOl derived a genie-aided bound to assess the capacity advantage of non-causal state over causal 
state for both the point-to-point channels and MACs with independent state sequences available at 
the two transmitters. Reference [fTT| explicitly characterized the capacity region of a cooperative 
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MAC with states non-causally available at one encoder, while reference lfT2]| proposed several 
inner and outer bounds for a MAC with states non-causally known to some encoders. A lattice 
coding strategy was proposed for a MAC with non-causal state information as in |fT3l and [fT4ll . 

While the work summarized above focused on non-causal or causal state information, recently, 
references [|T5l and [[T6l have demonstrated that, even with strictly casual state information at the 
encoders of a state-dependent MAC channel, an improvement in the capacity region is possible. 
In ifTSl . a common state sequence is known either strictly causally or causally at both of encoders 
in a two-user MAC, while in lfT6l two independent state sequences are present with each strictly 
causally or causally known to one encoder. An achievable rate region is derived in both papers 
and the capacity region is identified for some special cases including Gaussian models. The 
main idea in the achievability proofs is to use a block Markov coding scheme in which the two 
users cooperatively ifTSl or non-cooperatively lfT6l transmit compressed state information to the 
decoder, which in turn uses such information to perform coherent decoding. The results show 
that an increase in the capacity region can be obtained by devoting part of the transmission 
resources to the transmission of the compressed state. 

In this paper, we first focus on the MAC with independent states each strictly causally 
known to one transmitter of lfT6l . Combining the same idea of letting transmitters convey a 
compressed version of the state sequence to the destination, we propose a new achievable scheme 
with simple encoding and a decoding strategy similar to the noisy network coding ifTTl . The 
resulting achievable rate region is potentially larger than the original since it admits a larger 
input distribution. The capacity result for Gaussian channels of ffT6l for the case of a single 
state sequence is generalized to a larger class of channels that includes modulo-additive state- 
dependent MACs. The proposed scheme is then extended to arbitrary number of users. A similar 
scheme is also applied to derive an achievable rate region for the MAC of arbitrary number of 
users with common state. As a special case, it is shown to recover the region for the two-user 
MAC with common state given in ifTSl . 

Finally, we introduce output feedback in our model with independent states. We show via a 
specific example that feedback allows user cooperation for the transmission of state information 
to the receiver, beside standard cooperation on the transmission of messages ifTSl . and this 
increases the capacity region. This observation is somewhat related to recent work [[T9l where it 
is shown that common state information allows empirical coordination in a MAC with non-causal 
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State information, along with state information at the decoder and conferencing links between 
encoders. 

The remainder of the paper is organized as follows: In Section [III we describe the general 
model considered in this work and summarize some existing results in |fT6l . Sections |lll] and 
ITVl focus on a two-user state-dependent MAC. Section |V] provides a generalization to arbitrary 
number of users with independent states. Section |VI] studies MACs of arbitrary number of users 
with common state. Section IVIII introduces output feedback and provides an example to show 
the benefits. Section IVIIII concludes the paper. 

Throughout the paper probability distributions are denoted by P subscripted by the random 
variables involved, e.g., Px (x) is the probability of X = x, Py\x {y\x) is the conditional 
probability of Y = y given X = x, etc. When the meaning is clear from the context, we will 
drop subscripts from the probability functions, e.g., P (x) will stand for Px (x). Also x^ denotes 
vector [xki, Xki]. E [X] denotes the expectation of random variable X. Mf denotes the set of 
non-negative real vectors in i dimension. Notation co refers to the convex hull operation of a 
set. A/'(0,(T^) denotes a zero-mean Gaussian distribution with variance cr^. C{x) is defined as 

C(x) = il0g2(l+X). 

II. Channel Model and Preliminaries 

In this section, we describe our channel models, formulate the problem and revisit some related 
results derived in previous work |fT6ll . 

A. MAC with Independent States 

We first investigate an M -user discrete memoryless MAC channel with M mutually indepen- 
dent states, which is denoted by: 

{Xi X ... X Xm.Si X ... X SM,y,P{si) ...P (sm) ,P{y\ Xi, xmi Si, sm )) (1) 

with input alphabets {Xi, ...,Xm), output alphabet y and state alphabets (iSi, ...,Sm)- The state 

M M n 

sequences are assumed to be i.i.d. and are mutually independent, i.e., Yl P i^l) = 11 11 (■^fci)- 

k=l k=l 1=1 

The state-dependent channel is memoryless in the sense that at any discrete time i = 1, ...,n: 

P {yi\xi, X^j, Si, s'^,j,y^ ^) = P {yi\xii, XMi, Sii, SMi) ■ (2) 



5 



Each state realization is available to one encoder in a strictly causal manner as defined in Section 
m Transmitter /c's signal x^' is subject to an average input cost constraint: 

1 " 

- VE[cfc(Xfc,)] <rfc, A; = 1,...,M, (3) 

n ^-^ 

1=1 

where Ck : Xk -> is a single-letter input cost function for transmitter k and the expectation 
is taken with respect to all the messages and states. We now define the following code. 

Definition 1: Let Wk, uniformly distributed over the set Wk = [1 : 2"^*], be the message sent 
by transmitter k. A (2"^\ 2"^", n, Fi, Tm) code with strictly causal and independent state 
information at the encoders consists of sequences of encoder mappings: 

fk,^■. WkxSl'^Xki, ^ = l,...,n, k = l,...,M, (4) 

each of which maps message Wk to a channel input such that (|3]) is satisfied, and a decoder 
mapping 

g: y"^Wi X ... X Wm, (5) 

which produces the estimate of messages (wi, ...,wm)- 
The average probability of error, Pr (E), is defined by: 

Ft{E) = - J2- 5^ Pr(^?(2/")7^K,...,^A/)IK,...,w^M) sent) (6) 

2nRk wi=l WM=1 

k=l 

Given a cost tuple T = (ri,...,rM), a rate tuple {Ri, Rm) is said to be F-achievable 
if there exists a sequence of codes (2"^^, 2"^^^, n, Fi, r^/) defined above such that the 
probability of error Fi{E) — as n — t- oo. The capacity region C (F) is the closure of all the 
F-achievable rate tuples. 

B. MAC with Common State 

We also investigate a general M-user discrete memoryless MAC channel with common state, 
which is denoted by: 

Xi, Xm, s)) (7) 
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with input alphabets {Xi, ...,Xm), output alphabet y and state alphabet S. The state sequence 

n 

is assumed to be i.i.d., i.e., P(s") = H^l-^i)- Moreover, the state-dependent channel is 

1=1 

memoryless in the sense that at any discrete time i = I, ...,n, we have 

P {yi\x'l,...,xlj,s'',y'~'^) = P{yi\xu,---,XMi,Si) . (8) 

Assume the common state is strictly causally known to all encoders. Again, each transmitted 
sequence x]^ is subject to an average input cost constraint: 

1 " 

-Y,^[ck{Xk^)]<T,, k = l,...,M, (9) 

1=1 

where : — ?■ is a single-letter input cost function for transmitter k and the expectation 
is taken with respect to all the messages and states. We have the following code definition. 

Definition 2: Let Wk, uniformly distributed over the set = [1 : 2"^*], be the message 
sent by transmitter k. A (2"^\ 2"^", n, Fi, Fa/) code with strictly causal common state 
information at the encoders consists of sequences of encoder mappings: 

fk,^■. WkxS'-'^Xki, z = l,...,n, k = l,...,M, (10) 

each of which maps message Wk to a channel input such that ^ is satisfied, and a decoder 
mapping 

g: y'^WiX ... X Wm, (11) 

which produces the estimate of messages {wi, ...,wm)- 

The average probability of error Pr {E), the F-achievable rates and the capacity region C (F) 
are defined as was done for the model in III-AI 

We first restrict our attention to a two-user MAC with two independent states, and then 
generalize to an arbitrary Af-user MAC with independent states in Section |Vl We study an 
arbitrary M-user MAC with common state in Section |Vll 

C. Preliminaries 

For comparison, we summarize and slightly extend a key result of lfT6l . 
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Theorem 1: [[T6l Let T = (Fi, be given. For the two-user MAC with strictly causal state 
information, a F-achievable rate region is given by: 

{RuR2) : ((i?i,i?2),(ri,F2)) e 



T^inl (ri, F2 



CO U (^.nl(/3l,/32),(/3l,/32)) 
/3i>0,/32>0 



(12) 



where TZmi /32) is the union of the sets: 

(i?!, R2) G 1^2^ : 



7^..l (/3i,/32) = |J 



i?i < / (Xi; F 1X2, Vu V2)-I {V,; \Y, V2) 
R2<I (X2; Y |Xi, Vu V2)-I {V2; S2 \Y, Vi ) 
y Ri + R2<I{X^,X2;Y\Vi,V2)-I{Vi,V2;Si,S2\Y) J 
with the union taken over the distributions in the set of 



(13) 



Vsc — {Pvi.y2,Si,S2,Xi,X2,Y '■ Pvi\Si PV2\S2 PsiPs2PxiPx2Py\Si,S2,Xi,X2 } 



(14) 



satisfying E [ck (X^)] < A; = 1, 2. 

Remark 1: The F-achievable region given here is a straightforward extension from the original 
one in [[T6ll where time-sharing among different cost constraints is allowed, see for example, [20] . 

The basic idea of the achievable scheme of Theorem [U is to let transmitters convey a com- 
pressed version of the state, namely Vi for and V2 for 5*2, to the receiver. The decoder can 
then use this partial information about the state to improve decoding. As an example, if the state 
models fading channels, state information enables partially coherent decoding. The proof of the 
theorem, though not available in detail in lfT6l . is there indicated to be based on a scheme that 
exploits distributed Wyner-Ziv compression ETI and block Markov encoding in lfT6l . □ 



III. A New Achievable Rate Region 

In this section, for the two-user MAC with two independent states, we propose a new achiev- 
able scheme that is based on simple encoding, whereby no block-Markov operation is necessary, 
and on a decoding strategy that is similar to that of the noisy network coding scheme of [[TtI . 
The scheme achieves a potentially larger achievable rate region and can be easily generalized to 
an arbitrary number of users as discussed in Section |Vl 
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Theorem 2: Let V = (ri,r2) be given. For the two-user MAC with strictly causal state 
information, a F-achievable rate region is given by 



'^-^^'^'^'^^^=< CO U (7^.n2(/3l,/32),(/3l,/32)) ^ ^'^^ 

/3i>0,/32>0 



where TZin2 (^2) is the union of the sets: 

{RuR2) e : 



7^.n2 (/3l,/32) = |J^ 



i?i < / (Xi, F 1X2, V2)-I (Fi; 5i 

i?2 < / (X2, ^2; 1^ ) - J (V2; 52 1X2 ) 

[ i?i + i?2</(Xi,X2,l^i,V^2;F)-/(\/i;5i|Xi)-/(\/2;52|X2) J 



(16) 

with the union taken over the distributions in the set of 

^*sc = {Pvi,V2,SuS2,Xi,X2,Y '■ fVi\Si,Xi PV2\S2,X2 PsiPs2PxiPx2Py\Si,S2,Xi,X2 } (17) 

satisfying E [ck (Xk)] < k = 1, 2. 

Proof: The theorem follows as a special case of the M-user result of Theorem [5] for M = 2. 
We refer the reader to Appendix |A] for the proof of Theorem [5l ■ 

Remark 2: The region (fT6l) is potentially larger than (fT3l) since it allows a more general input 
distribution. In fact, it is admissible to generate Vk dependent of both Sk and Xk, k = 1,2. This 
is proved in Theorem [3l □ 

Remark 3: In the proposed strategy, similar to [fT6ll . at each block, each transmitter sends out a 
codeword which carries both message information and information regarding the state sequence 
in the previous block. However, unlike [[T6ll . the encoders do not use block Markov coding but 
instead encode the entire message over all the blocks, as in noisy network coding scheme of [[TtII . 
Moreover, as in ([TTI. the decoding is implemented jointly over all blocks without requirement 
of decoding all compressed states correctly. □ 

Remark 4: By applying the strategy in ^ as done in Theorem 2 in [fT6ll . our proposed 
technique can be easily generalized to a system where channel state information is known causally 
to the encoders. Details are omitted for the sake of conciseness. □ 

Theorem 3: The new achievable rate region of Theorem [21 includes the original of Theorem 
[B i.e., TZin2 (Fi, r2) ^ TZini (Fi, r2) for all cost constraint pairs (Ti, 12). 



Proof: Given any cost constraint pair (/3i,/32), setting Pvi\Si,Xi = Pvi\Si and Pv2\S2,X2 = 
Pv2\S2 in '}Zin2 /32) we obtain the following, 
1) For the sum-rate bound, 

R, + R2<I (Xi, X2; Y\VuV2) + I (Vu V2; Y)-I {V^■ \X^) - I {V2; S2 {X^ ) (18) 



= / (Xi, X2; Y\V,,V2) + H {V^ \S,) + H{V2\S2)-H {V^, V2 \Y) (19) 
= I{X^,X2■,Y\VuV2) + H{V^\SuS2,Y) + H{V2\S2,SuVuY) 

-H{VuV2\Y) (20) 

= I (Xi, X2; Y\VuV2)-I {Vu V2; 5i, 52 IF) (21) 



where (|20l) follows from the Markov chain {Vi,V2) o (5*1, 5*2) o F and from the fact that 
(Vi,S'i) are independent of (V2,S'2). Note the last equation is exactly the same sum-rate 



bound in TZini 132) given by (fT3l) . 
2) For the individual rate bound on Ri, 

R^<I (Xi; Y 1X2, V^,V2) + I [Vr-Y IX2, V2) - I {Vv^ S, \X, ) (22) 

= / (Xi; Y 1X2, V^i, 1^2 ) + (Vi 1^1 ) - (V^i X2, V2) (23) 

>J (Xi; F 1X2, Vu V2) + H {Vr \S^) - H [V^ |F, V2) (24) 

= I (Xi; F 1X2, 1^1, V2) + H (Vi Y,V2)-H {V^ |F, 1^2) (25) 

= J (Xi; Y 1X2, 1^1, 1^2) - / (V^i; 5i |F, V2) (26) 



where (l24l) follows from conditioning reduces entropy while (1251) follows from the Markov 
chain Vi o 6*1 o Y . The last equation is exactly the same as the bound on Ri in 
7^^„l (/3i,/32) given by ([l3]). 
3) A similar observation holds for R2 by symmetry. 

These three facts imply that TZin2 (/3i,/32) 5 Tlini {13 1, 132) and applying time-sharing among 
different cost constraints, we conclude that '^m2(ri,r2) ^ '^mi(ri,r2) for all every cost 
constraint pair (Fi, r2). ■ 

IV. Capacity Result 

In this section, we generalize the capacity result for Gaussian channels of [[T6ll for the special 
case of a single state sequence to a larger class of channels. We also provide an example that 
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demonstrates the benefits of strictly casual state information. 

Consider a class of discrete memoryless two-user deterministic MACs denoted by Vmac, in 
which the output F is a deterministic function of inputs Xi, X2 and the channel state S: 



Y = f{X,,X2,S) 



(27) 



c(r)^U 



> 



(29) 



and where state S is strictly causally known to encoder 1, while the deterministic function 
/(.,.,.) satisfies the following assumption: 

H{Y\XuX2)=H{S) (28) 

for all product input distributions Px^Px2 satisfying input cost constraints ([3]) for k = 1,2. 
Then the capacity region for Vmac is identified as follows. 

Theorem 4: Let F = (11,12) be given. For any MAC in Vmac defined above, the capacity 
region C (F) is given by: 

' (i2i,i?2) e : 
Ri<H{Y\X2,Q)-H{S) 
R2<H{Y\Xi,S,Q) 
y R, + R2<H{Y\Q)-H{S) J 
where the union is taken over all product input distributions Pxi\q Px2\q Pq satisfying E [c^ (X^)] < 
Ffc, A; = 1, 2, and Q is an auxiliary random variable with cardinality bound |Q| < 5. 
Proof: 
• Achievability: 

We provide the proof of achievability for Q = q constant and drop the conditioning on q 
for simplicity. The region (l29l) then follows by using coded time-sharing [f22| . 

Fix PxiPx2PsPy\XuX2,s and E [c^ (X^)] < F^, k = 1,2, for a given Q = q. By setting 
V2 = 5*2 = and Vi = 5*1 = S* in the achievable region (fT6l) we derived, exploiting the 
property (l28l) of deterministic channels, rate pair (i?i,i?2) is achievable where 

Ri < I {Xi, S;Y 1X2) - I {S; S \Xi 



= H {Y 1X2) - H {S) 
R2<I{X2;Y\X^,S) 
= H{Y\X^,S), 



(30) 
(31) 
(32) 
(33) 
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Ri + R2<I (Xi, X2, S;Y)-I {S; S \Xi ) (34) 
= H{Y)-H{S) (35) 

and 

E[cfc(Xfc])]<rfc, A: = 1,2. (36) 

Converse: 

Introduce a random variable Q uniformly distributed over the set {1, 2,..., n}. 
From Proposition 1 and 2 in |fT6l . we have: 

1 " 

i2i<-V/(XH;F,|X2,) + en (37) 
n ^-^ 

i=l 

1 " 

+ i?2 < - V / (Xii, Fi) + e„ (38) 



n 
1=1 



so that 



1 " 

Ri<-Y,I {Xi,; n 1X2,, Q = i) + en (39) 



n 

i=l 



and 



= /(XiQ;FQ|X2Q,g)+e„ (40) 
= J(Xi;F|X2,g) + e„ (41) 
= i7(r|X2,g)-i/(5) + e„ (42) 

1 " 

i?i + < - V / X2,; |g = 2) + e„ (43) 

i=l 

= /(XiQ,X2Q;FQ|g) + e„ (44) 
= /(Xi,X2;F|g) + e„, (45) 
= H{Y\Q)-H{S) + en (46) 

where e,i — )■ as n — )■ 00. 

Moreover, by providing perfect state information to the receiver, one can easily prove the 
following bound: 

1 " 

< - V / (X2,; |Xi„ 50 + en (47) 
n ^-^ 

i=l 
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1 " 

= -Y.^ (^2,; n g = z ) + e„ (48) 

i=l 

= I{X^Q-YQ\X^Q,SQ,Q) + e^ (49) 

= J(X2;F|Xi,5,g) + e„ (50) 

= if(F|Xi,5,Q) + e„ (51) 



where e„ — )■ as n — oo. It can be seen that the distribution on {Q,S,Xi,X2,Y) from 
a given code is of the form Pq,s,x^,X2,y = PqPxi\q Px2\Q PsPy\x^,X2,s ■ Notice that both 
(|42)) and (l46l) exploits property (l28l) and the fact that 5 is independent of Q. 
For the cost constraints, starting from the definition ([3]), we have 

1 " 

Tk>-Y,^[ck{Xk.)] (52) 

i=l 
1 " 

= - Pr (a;fci)cfc (xki) (53) 

n ■^^ — ' 

n ^ 

= V-VPr(xfc|g)cfc(xfc) (54) 
^-^ n ^-^ 

q=l Xk 

= E [ck (Xfc)] . (55) 
Finally, considering the five continuous functions of Pr (xi \q) Pr {x2 \q), 

g^ = H {Y \X2,Q = q) ,92 = H {Y \X^,S,Q = q) ,g:, = H {Y \Q = q) , (56) 
= E [ci (Xi) IQ = g] , = E [C2 (X2) |g = g] , (57) 

one can easily establish the cardinality bound \Q\ < 5 by the Lemma in Appendix C of 
[|22l . This concludes the converse and thus establishes the capacity region. 

■ 

Remark 5: Notice that in the achievability proof, we set Vi = Si = S, which implies that 
Vi is independent of Xi. Hence the achievable scheme proposed in [[T6ll is also optimal for the 
class of channels considered here. □ 

Remark 6: The class Vmac includes the Gaussian model considered in [fT6ll . which is defined 

n 

as Yi = Xii+X2i + Si with input power constraints ^ X] ^ [X^J < Pk and 5*^ ~ A/" (0, erf) strictly 
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causally known to encoder 1. The capacity region C"*"^ is given by: 



Qsnf 



(58) 



This can be identified from Theorem |4] by simple extension to continuous alphabets and evalu- 
ating each bound by the maximum entropy theorem ll23l . Note that when providing both S and 
Xi to the receiver, the channel from user 2 to the receiver is noiseless and hence the individual 
bound on R2 goes to infinity, which is redundant. □ 

Remark 7: The class Vmac contains more channels along with the Gaussian model discussed 
in Remark [6l In particular, consider a class of binary modulo-additive state-dependent MAC 

n n 

channels, e.g., Y = Xi©X2©S', with input cost constraints - ^ E[Xij] < pi and - ^ E[X2i] < 

" i=l " 4=1 

P2 where < pi < |, < j»2 < |- Assume S ~ Bernoulli (ps). Note that assumption (l28l) 
automatically holds for this class of binary deterministic channels. The capacity region is: 

' (i?i,i?2) eR+ : 
^1 < Hh {pi * Ps) - Hb (ps) 
R2 < Hb (P2) 
^ Ri + R2< Hb {pi *P2* Ps) - Hb (ps) 
where pi * p2 denotes the convolution operation of two Bernoulli distributions with parameters 
pi and p2, i.e., pi*p2 = Pi (1 -P2)+P2 (1 -pi), and Hh (p) = -plog2p- (1 - p) log2 (1 -p)- 
The capacity region (l59l) can be identified from Theorem H] as follows. Given a fixed Q = q 

and input constraints E [c^ (X^) \Q = q] < Pkq, for some pkq such that J^Pkq Pr (Q = (?) < Pk, 

Q 

each bound in (|29|) is maximized by the same input distributions Bernoulli (pkq), k = 1,2. For 



bin 



(59) 



example, on bound Ri 



H{Y\X2,Q = q)-H{S) 
= H{X^®S\Q = q)-Hb{ps 

< Hb {piq*Ps) - Hb (Ps) 



(60) 
(61) 
(62) 



where (|62|) follows from the fact that entropy H {Xi ® S \Q = q) monotonically increases with 
the probability of Xi = 1. Hence, 

H{Y\X2,Q)-H{S) (63) 
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hi: 



bin 



(67) 



= J2H{Y\X2,Q = q)-H{S) (64) 
Q 

<Y,Hb{PM*Ps)-H,{p,) (65) 
Q 

< H, {pi * ps) - Hb (ps) (66) 

where (l66l) follows by the concavity of entropy. Thus there is no need for time-sharing among 
different cost constraints. By similar arguments on the other two bounds, we claim the capacity 
region in (l59l) . □ 

Remark 8: It is known from that, without state information, the capacity region for this MAC 
channel is given by [|23l : 

{Ri, R2) G • 
^1 < Hh {pi * Ps) - Hb (ps) 

R2 < Hb {P2 * Ps) - Hb (Ps) 

^ Ri + R2< Hb {pi *P2* Ps) - Hb (ps) 
Hence C^f^ C C^in, which demonstrates the benefit of strictly causal state information in enlarging 
the capacity region. For a numerical example, we set Pi = P2 = | and Ps = j- The corresponding 
regions (|59l ) and (|67] ) are depicted and compared in Fig. [TJ It is evident that the presence of 
strictly causal state information at encoder 1 improves the maximum rate of user 2. □ 

Remark 9: Consider the class of models discussed in the previous example Pi = P2 = P- 
We now discuss an alternative way to achieve the maximum rate of user 2, i.e., R2.max = 
Hb {p * p * Ps) — Hb (ps), for cases where the individual bound on R2 is larger than the sum-rate 
bound. 

The scheme works as follows. We split the rate of user 2 into two parts, i.e., R2 = R2p + R2c- 
The total number of channel uses is divided in blocks in a different fashion for the two users: For 
user 1, each block is of unequal length while for user 2, each block consists of the same number, 
say m, of channel uses. Notice that both users employ potentially all channel uses but the block 
boundaries are different. In the first block of size m (same for both users), user 1 keeps silent 
and user 2 precodes message W2c G [l : 2'"^''^^)] into block of information (tf 2c) that look 
i.i.d according to input distribution Bernoulli (p) and sends it out over m channel uses ll24l . 
At the end of this block, user 1 learns the state sequence and it starts to help user 2 to convey 
the state information to the receiver. Explicitly, for each subsequent block, user 1 losslessly 
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0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 
Fig. 1. Capacity region for an example of modulo-additive state-dependent MAC with input constraints (pi — P2 ~ ^,Ps = j) 

compresses the state sequence in the previous block and sends out the index over current block 
consisting of certain number of channel uses governed by the rule in [|24|. Superimposed with 
transmission of user 1, user 2 sends out a new codeword {^2p,j) with W2pj E [l : 2'"^2pj^ 
which is drawn from a binary codebook randomly and independently generated subject to the 
input constraint for each block j, j > 2. The receiver performs successive decoding, i.e., first 
decodes W2pj for each block treating binary bits sent by user 1 and states as noise, subtracts 
them out, and then decodes W2c- As the total number of channel uses n — oo, it can be shown 
that rate = I (^2p; Y) = Hi,{p* p* ps) — Hb{p* ps) is achievable. After decoding W2pj 
for each block, subtracting them out from the received signal, by analysis in [f24| . W2c can be 
successfully decoded at rate i?2c = Hh(p*ps) — Hi,(ps), thanks to the state refinement from 
user 1. Therefore R2,max = R2c + R2p = Hb {p * p * Ps) — Hb (ps) is achieved for user 2. □ 

V. Generalization to M Users with Independent States 

In this section, we generalize the proposed achievable scheme to an arbitrary number of users 
with independent states. 

Let S denote any subset of [1 : M], i.e., 5 C [1 : M] and S"^ be the complement of S. Any 
random vector X {S) is defined by X {S) = {Xi, X2, ...,Xk) ,k = \S\, i.e., the cardinality of S. 
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Theorem 5: Let V = {Ti, ...,Tm) be given. For the M-user MAC with strictly causal and 
independent state information, a F-achievable rate region is given by 

Rm) '■ ((-Ri, Rm) , (ri, Tm)) e 



nt^, (ri,...,r 



M) 



CO U (7^f^(/3l,...,/3M),(/3l,•••,/3^f)) 

/3i>0,...,/3m>0 



(68) 



where TZf'^ /Sm) is the union of the sets: 



7^fi(/3l,...,/3M) = U^ 



E < min / (X (5) , V (5) ; r |X (5^) , V (5^) ) - E ^ (^fc; '^fe 
rc5 



V r C [1 : M] 



with the union taken over the distributions in the set of 

M 



- { Pvi,...,Vm,Si,...,Sm,Xi,...,Xm,Y ■ Y\_ {PVk\Sk,XkPSkPXk)PY\Si,...,SM,Xi, 



,Xm 



(69) 



(70) 



k=l 



satisfying E [ck (X^)] < f3k, k = 1, M. 
Proof: See Appendix |Al 



VI. M-usER MAC WITH Common State 

In this section, we apply a similar scheme to that for the MAC with independent states and 
derive an achievable rate region for the M-user MAC with common state information. 

Theorem 6: Let T = (Fi, Tm) be given. For the M-user MAC with strictly causal common 
state information, a F-achievable rate region is given by 

(-Ri, Rm) '■ ((-Ri, Rm) , (Xi, Tm)) G 



T^in'' (ri,...,rM) - 

where TZ^"'^ (/3i, /3m) is the union of the sets: 



CO U (7^*['^(/3l,...,/3^f),(/3l,...,/3A/; 

/3i>0,...,/3m>0 



7^,t['^(A,...,/3M) = U< 



{Ri, Rm) e : 
E Rk<n^{[l:M]),V-Y)-I{V-S) 

k£[l:M] 

j:Rk<nX{S)-Y\U,V,X{S'^)), 

k£S 

^ V 5 C [1 : M] 



(71) 



(72) 
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with the union taken over the distributions in the set of 



V. 



M,c 
sc 



V,S,U,Xi,...,Xm,Y '■ PsPv\sPu 



P^ 



y|5,Xi,...,XM 



(73) 



satisfying E [c^ (X^)] < k = l, M. 

Proof: The achievable scheme follows a similar idea of state compression-and-forwarding 
based on noisy network coding while coupled with a cooperative transmission strategy for the 
MAC with common information ll25l . The proof of this theorem is provided in Appendix |B] ■ 
Remark 10: Given any cost pair (/3i,/32), setting M = 2 in 7?.^^'^ derived above, we recover 
the achievable region for the two-user MAC with common state in [[TSl . which is given by the 
union of the sets: 

< J(Xi;r|f/,y,X2) 
R2<I{X2-,Y\U,V,X^) \ (74) 

Ri + R2<I{X^,X2-,Y\U,V) 
y Ri + R2<I{X,,X2,V;Y)-I{V;S) ^ 
where the union is taken over the distributions in the set of 



^•^(/3i,/52) = U 



V. 



2,c 
sc 



{Pv,s,u,Xi,X2,Y '■ PsPv\sPuPxi\uPx2\uPy\s,Xi,X2} 



(75) 



satisfying E [ck (Xk)] < Pk, k = 1, 2. □ 

Remark 11: As shown in ifTSl . (1741) is the capacity for the two-user Gaussian MAC where 
the state comprises the channel noise and it is known strictly causally at both encoders. □ 

VIL Introducing Output Feedback 

In this section, we briefly consider an extension of the model with independent states studied 
in Section |lll] where output feedback is available to some encoders in addition to the strictly 
causal state information. It is well known that the use of output feedback can enable cooperation 
in regular state-independent MACs as ifTSl . Il26l . Il27l . Here we focus on a specific example and 
show that output feedback can enlarge the capacity region for the class of MACs studied in this 
work. 

Let us consider a two-user state-dependent Gaussian MAC shown in Fig. [2l 



Yi — Xii + X2i + Si 



(76) 



18 

n 

with input cost constraints, e.g., power constraints, - X] [^ki\ — ~ -^(O)'^s)' 

" i=l 

k = 1,2 and i = l,...,n. Let us assume that the state information is strictly causally known 
to the first transmitter and a perfect output feedback link is available from the receiver to the 
second transmitter. We have the following encoder and decoder mappings. 




Fig. 2. The state-dependent MAC with strictly causal state information at TXl and output feedback at TX2. 



Definition 3: Let Wk, uniformly distributed over the set Wfc = [1 : 2"^*], be the message sent 
by transmitter k, k = 1,2. A (2"^\ 2"^^, n, Pi, P2) code with strictly causal state information 
at TXl and output feedback to TX2 consists of sequences of encoder mappings: 



/2,^: mxy-'^X2^, t = l,...,n, 
such that power constraints are satisfied and a decoder mapping 

g: 3^" ^ Wi X W2. 
And the average probability of error Pr (E) is defined by: 

P^(^) = 2ni?.2nH. E E P^(^?(y") ^ IK,^2) sent) 

Wl = l W2 = l 

Theorem 7: The capacity region of the model in Fig. |2] is given by: 



c"- U 

0<p<l 



(-Ri, R2) G : 
(i-P^)Pi 



Ri<C 



(77) 
(78) 

(79) 



(80) 



(81) 
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Remark 12: Without feedback, it is known from IfTSl that if the state is known strictly causally 
to both encoders, the capacity is given by: 

_ J {Ri, R2) e M+ : 

^ ~ \ i^l + i^2<g( ^^+^^g^ ) 

whereas if the state is only known strictly causally at encoder 1, the capacity region C'^"^ is 
given by (l58l) . see also lfT6l . 

We plot a instance of these three capacity regions by setting Pi = P2 = 2 and cr^ = 1 in 
Fig. |3l As we observe, C^^^ C C^^ C C^^. By comparison, it can be seen that feedback allows to 
reap some of the benefits that come from cooperation on the transmission of the state sequence, 
since encoder 2 can learn the state information via feedback. □ 

1.6 

1.4 
1.2 
1 

Dr" 0.8 
0.6 
0.4 
0.2 


0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 





Fig. 3. Comparison of different capacity regions 



Remark 13: Consider a forth case where there is no state information at encoder 1 but output 
feedback is available to encoder 2. While the capacity of the case is unknown in general, it 
can be seen that R2 < C j . This is because the capacity of user 2 cannot be improved via 
feedback. Therefore, the capacity region of this case is strictly smaller than the case where the 
state is known at encoder 1, i.e., C*'^. This demonstrates the importance of strictly causal side 
information at encoder 1 in order to fully realize the advantages of output feedback available to 
encoder 2. □ 
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Now we provide the proof of the capacity region C*^. 

Sketch of Proof: 
Achievability: 

The key idea of the achievable scheme is based on the extended Schalkwijk-Kailath feedback 
coding, see IfTSl . [|28l . User 1 divides its power into two parts. It consumes fraction a (0 < a < 1) 
of its power to send its message wi over n channel uses using a codeword drawn from a codebook 
whose entries are generated in an i.i.d. fashion from a zero-mean Gaussian distribution with 
variance aPi. It uses the remaining portion (1 — a)Pi to help user 2 transmit state information 
via feedback coding. 

Specifically, we have the following: 

Codebook Generation: Randomly generate 2"^^ i.i.d. sequences x"p with xipi ~ A/'(0,aPi). 
Index them by x"p (wi) with wi e [l : 2"^^]. Partition [—1 : 1] into 2"^^ small intervals of 
equal length and map messages W2 & [l : 2"^^] to the middle points of these intervals. Index 
them by 6 (102). 

Encoding: 

1) First channel use (i = 1): To send a message pair (^1,^2), user 1 sends out the first 
component xip^i{wi) of Xip{wi) while user 2 sends out 9 {W2). By feedback, user 2 learns 
a noisy state s[ = si + xip^i{wi) after subtracting its own information. User 1 instead 
knows Xip^i{wi) and si as well by our assumption so that it can form the same noisy state 
s[. Hence user 1 and 2 fully cooperate to convey this information to the receiver in next 
channel use. 

2) Other channel uses {i > 2): Let s'-_^ = Sj„i + Xip-i^i (wi); user 1 forms xu = Xip^i (wi) + 
liiSi_i with 7iiS-_i ~ A/'(0, (1 - a) Pi), while user 2 forms X2i = l2is'i_i, with 72iS-_i ~ 
A/'(0,P2); then xu and X2i are sent out over channel use i. 

Decoding: After n channel uses, the receiver first estimates s[ = E,[s[ then, 9 {W2) = 
Hi — s[ = 9 {W2) + (s'l — E [s[ After decoding W2, subtracting it from the received signal, 
wi is decoded. 

Analysis of Probability of Error: We note that using the union bound, we have, Pr (E) < 
Pr (E2) + Pr {El I -£'2)5 where the first term corresponds to the probability of decoding error of 
W2, and the second term is the probability of decoding error of wi given W2 is correctly decoded. 

The probability of decoding error Pr {E2) vanishes as the variance of estimation error of s[ 
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goes arbitrarily small as n — oo. Similar to [[T8l . it can be shown that Pr (E2) — as long as 



^^^^ ;i-a)P.+A + 2V(l-a)f.A (,3, 

aj + aPi 

for sufficiently large n. 

After decoding canceling it, we have Pr {Ei lE!^) — )■ as long as: 

for sufficiently large n. Setting a = (1 — p^), we have: 

^/(i-om+A+MiESMVcf^) (85) 

= C(^±^±|^^) (86) 

Hence any non-negative rate pair R2) that satisfies the following constraints simultaneously 
is achievable: 

ft < c (^1^) . (87) 



ft + ft<c(^±^i±^^^) (88) 

for all < p < 1. 
Converse: 

Providing message W2 to encoder 1, the channel becomes the MAC studied in ||29l where 
encoder 1 knows both wi and W2, encoder 2 knows W2 and output feedback is available at 
the encoders. In fact, the state sequence at encoder 1 in this genie-aided model can be seen as 
equivalent to feedback, since via feedback, encoder 1 effectively obtains s*"^. It is shown in 
l|29l that feedback does not increase capacity and thus the capacity region is given by (ISTI) . ■ 

VIII. Conclusion 

In this work, we have investigated whether states causally known at transmitters in multiple 
access channels is beneficial to achieve larger rates. To answer this question, we first revisited a 
two-user MAC studied in ffT6l and derived a new achievable rate region based on simple encoding 
at the transmitter and a decoding strategy similar to noisy network coding. The new region turns 
out to be potentially larger than that in [|T6l . The capacity result for the Gaussian model of 
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|fT6l for the special case of a single state sequence is generalized to a larger class of channels 
that includes modulo-additive state-dependent MACs. Next, we generalized the proposed scheme 
to an arbitrary number of users case and established the achievable rate region for a class of 
state-dependent Af-user MACs. Based on a similar strategy, we also established the achievable 
rate region for Af-user MACs with common state. In the end, we introduced output feedback 
in the model at hand with independent states and gave an example to show the benefits of 
feedback in enlarging capacity region for a state-dependent channel model. We find that output 
feedback allows state correlation and cooperation besides message cooperation in a multiple- 
user state-dependent channel. As future work, it will be interesting to investigate the impact of 
output feedback for the state-dependent MAC in general. Another direction interesting is to study 
some other multi-user state-dependent channels, e.g., state-dependent interference channels, to 
see whether we can obtain the benefits of causal state information demonstrated in this work or 
not. 

Appendix A 
Proof of Theorem [5] 

Consider b blocks of transmission and assume the same message vector = (wi, ....,wm) is 
transmitted over all the blocks. 

• Codebook Generation: 

M 

Fix Pv,,...,v,,,Sr,...,SM,x„...,XM,Y = U {Pvk\s,„x^ Ps^Px,) Py\Su-,Sm,x,,...,Xm and E [cfc (Xfc)] < 

k=l 

(3k, {k = 1, M) for any given cost tuple (3m)- 

1) For each block j , j E [1 : b], randomly and independently generate 2"^^*^ x 2"^*=" i.i.d. 

n 

sequences according to P (x'^j) = 11 i^kj,i)- Index them as x'^j {wk, tk,j-i), with 

i=l 

Wfc G [1 : 2"^^*] and tkj-i G [l : 2"^^-], k = 1,2, ...,M. 

2) For each block j, j G [1 : 6], for each x^j {wk,tkj-i), randomly and independently 
generate 2"^*" i.i.d. sequences according to the conditional probability mass func- 

n 

tion (pmf) P [v](j \xlj) = H P { ) for each fixed P {vkj \xkj,s'^j). Index 

i=l 

them as t;^. {tkj \wk,tk,j^i), with tkj G [l : 2"^*-], k = 1,2, ...,M. 

• Encoding: 

For j = 1: x]^^ {wk, 1) is sent out by each transmitter, k = 1,2,..., M. 
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For j G [2:6]: At the end of (j — l)th block transmission, the encoders learn the entire 
state sequence s" j_i and S2 j-i respectively. Each encoder looks for an index (compression 
index) tkj^i such that {s'^j_i,vlj_^{tk,j-i\wk,tk,j-2) ,x'^ -_-^{wk,tk,j-2)) G If more 
than one indices are found, choose the smallest one. If there is no such an index, choose 
an arbitrary index at random from [l : 2"^*^"]. Thus the codeword x^^j {wk,tkj-i} is sent 
out at each transmitter for the next block, i.e., the jth block. 
Decoding: 

After b blocks of transmission, the decoder looks for a message vector w = [wi,W2, wm] 
over the Cartesian product [l : 2"^^i] x [l : 2"''^2j _ ^ . 2nbRMj ^^^ch that: 

(x^" (w, t,_i) , (t, I w, t,_i ) , y^) e a: (89) 

for j e[l: b] and some = (tj, t^, t^^) = (tn, tAfi, tMb)- 
Probability of Error Analysis: 

To bound the probability of error Pr (E), without loss of generality, assume w = 1^/ are 
sent out and let Tkj = 1 be the compression index chosen by encoder k for each block j. 
Define the following encoding events: 

M 

^0 = U Eok (90) 

fc=i 

with each 

b 

Eok = [j{{4pVk^ ih, |i,r,,,_i) (i,rfc,,-i)) ^ ^:,V4,} (91) 

Define the following decoding events: 



(92) 



= {(x"(w,t,_i),v"(t,|w,t,_i),2/;) eA^} 
= {Aj (w,tj-,tj_i)} 

for j e[l: b] and some = (tj, t^, t^^) = (tn, tu, tjv/i, ■■■,tMb)- 
Thus by the union bound, 

Pr {E) < 5^ Pr {Eok) + Pr {E^^^ n E',) + Pr ( |J eS\ (93) 

k=l Vwt^Ia/ / 

M 

< 5^ Pr {Eok) + Pr {El^ n E^,) + Pr (E^) (94) 

k=l wt^Im 
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1) By the covering lemma Il22]| . it can be shown that Pr (-Eofc) — as long as 

Rk,>nSk;Vk\Xk) (95) 

for sufficiently large n. 

2) Assume Rkv satisfies the above bound, = 1, M. By the conditional joint typicality 
lemma [|22l, Pr [E^^^ fl Eq) — )■ for sufficiently large n. 

3) Focus on Yl Pr(-E'w): 

To facilitate analysis, for each and j E [2 : b], first define two sets: 

5,(w,Vi) C [1 : M] (96) 

such that Sj (w,tj_i) = {k : {wk 7^ 1) U 7^ 1)}' 

and 

7-(w,Vi)C[l:M] (97) 

such that Tj (w,tj_i) = {k : Wk 1}- 
Hence Tj Sj. 

Also define a third set Sj (w, tj_i) such that Sj (w, tj_i) = [k : tj l,k E Sj (w, tj_i) }. 
Therefore, iSj C Sj and 5^ U Sj = Sj. 
Now we have the fact that: 
x" (Sj (w, tj_i)) , v" (Sj (w, tj-i)) , v" (^Sj (w, tj_i)^ j are always independent of 
X" (w, t,_i)) , V" [S] (w, t,_i)) , 2/;) . 
For any fixed {w,Sj), by the standard argument on joint typicality [l23l Theorem 
15.2.1], we obtain: 

Pr(A,(w,t,,Vi)) (98) 

< 2-«(^(^('S.),v(5,),y(5,);x(5|),y(5|),y)-e) ^^^^ 

^ 2-"(^(^('S.),V(5,);X(5|)y(5j),y|l/(5,) )+/(y(5,);X(5|),y(5|),y)-.) (-jQQ^ 

< 2~n{l{xiS,),V{S,y,x{s^),V{S^),Y\v{Sjyye) (jQj^ 

^ 2-"(^(^('S.)-v-('5.);y|x(5|)y(5,=)y(5,))-.) ^^02) 
^ 2-"(^(^('S.)-v-('5.);y|x(5|)y(5|))-.) 

^Pr(yl7^^^(w,Vi)) (104) 
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for arbitrary e > and sufficiently large n, where (IIOII ) follows from the non-negativity 
of mutual information, (11021) follows from the independence fact mentioned above and 
finally (11031) follows from the definition of Sj (w,tj_i). 

Notice that the upper bound (11031 ) is dependent of tj_i only and independent of tj 

for any fixed w. 

Therefore, 



^ E Enp^(^^ (^'*^'Vi)) 
= E EEnp^(^^(^'t^'Vi)) 

^ E EErip^(^r^(-'Vi)) 

w^ljv/ tt t**-! i=2 

= E EriEp^(^r^(-'Vi)) 

/ 

r(w)C[l;A/] 



6-1 



1:A/] 



2-ri(l(w,S)-e) 



r(w)5^0 



5C[1:A/] tj_i:5j(w,tj_i)=5 
\T(w)C5 



2 fcer 2'=s[i'"l 2' -"^ •'2~^ ^^"^ 

r(w)C[l;A/] 

r(w)/0 



(105) 
(106) 
(107) 
(108) 
(109) 
(110) 
(111) 

(112) 
(113) 



with 



= IiX{S),V (S) ; Y \X (5^) , V {S^ 



(114) 
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and 



I(w,5)-$^i?fcJ (115) 



imin = mm 

SC[1:M] 

where (11071 ) holds by the union bound; (11081) holds due to the independence of codebook 
for each block, the memory less property of the channel and the fact that < Pr (Aj) < 1; 
(fnOl) and (fTT2l) follow from (fT04l) : (fTTTl) follows from the fact that Pr (^Af^^"" (w,tj„i)) 
is dependent of tj_i only given w. 
Finally Yl (-^w) — as long as: 

V r C [1 : M] , 

^ ni?fc, + (6-l)(M-l)<(6-l)r2(I,,i„-e) (116) 

fcer A;e[l:Af] 

i.e., 

V r C [1 : M] , 
fcer 

Setting 6 — )■ oo and n — )■ oo, 

$^i?fc<Imin (118) 

fcer 

= min I J(X(5),V^(5);r|X(5^),V(5^))- Vi?fcJ (119) 

5C 1:M \ / 

res V fce5 / 

< min (l{X{S),V{S);Y\X{S''),V{S''))-Tl{Vk;Sk\Xk)] (120) 
res V fce5 / 

Going over all T C [1 : M], we establish the achievable rate region 7?.^ /3m) given 

in (|69l) . Furthermore, exploiting time-sharing among different cost constraints, we establish 

the F-achievable rate region. 

Appendix B 
Proof of Theorem [6] 

Consider b blocks of transmission and assume the same message vector w = {wi, ....,wm) is 
transmitted over all the blocks. 
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• Codebook Generation: 

Fix Pv,s,u,Xu...,x,„Y = PsPv\sPu [j[Px,\u^ Py\s,Xu...,Xm and E [c^ (X,,)] < /3k, {k = 1, M) 
for any given cost tuple /3a/)- 

1) For each block j , j E [1 : b], randomly and independently generate 2"^" i.i.d. sequences 

n 

Vj- according to the marginal pmf P (f ") = 11-^ ^^^^h fixed P (f " | s" ) . Index 

i=l 

them as vj {tj), with t^- e [l : 2"^''] . 

2) For each block j, j E [1 : b], randomly and independently generate 2"^" i.i.d. sequences 

n 

according to P (n^') = [1 ^ Index them as (tj-i), with G [l : 2"^-]. 

3) For each block j, for each randomly and independently generate 2"''^'' i.i.d. 

n 

sequences x]^j according to the conditional pmf P (a;^^- |n" (tj-i) ) = 11-^ {xkj,i \uj,i (tj_i) ). 
Index them as x^^- (u;fc \tj.i), with u/^ G [l : 2"^^^=], for A; = 1, 2, M. 

• Encoding: 

For J = 1: (tffc |1) is sent out by each transmitter, k = 1, 2, M. 

For j G [2:6]: At the end of (j — l)th block transmission, all encoders learn the entire 

common state sequence Sj_i. They look for an index (compression index) tj^i through 

the common codebook such that (sj'-i^fj'Li {ij-i)) ^ If niore than one indices are 

found, choose the smallest one. If there is no such an index, choose the first index. Thus 

the codeword x^^ {wk \tj-~i) is sent out at each transmitter for the next block, i.e., the jth 

block. 

• Decoding: 

After b blocks of transmission, the decoder looks for a message vector w = [wi,W2, wm] 
over the Cartesian product [l : 2''^^^] x [l : 2""^^^] ... x [l : 2"^-^^"] such that: 

(x'^ (w ) , u] , (t,) , y^) E a: (121) 

for j E [1 : b] and some t** = (ti,t2, ■■■,tb)- 

• Probability of Error Analysis: 

To bound the probability of error Pr (E), without loss of generality, assume w = 1a/ are 
sent out and let Tj = 1 be the common compression index chosen for each block j. 
Define the following encoding events: 

b 
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Define the following decoding events: 

= { (x" (w ) , u] , (tj) , e a:} (123) 

^{A,(w,t,,t,_i)} (124) 

for j e [1 : b] and some = (^1,^2, ■■■,tb). 
Thus by the union bound, 

Fi{E)<Fi{Eo) + PT{El^nE^)+Fil [j E^] (125) 

< Pr (Eo) + Pr {El^ n Eq^) + Pr (^w) (126) 

1) By the covering lemma ll22l . it can be shown that Pr (Eq) — )■ as long as 

R,>I{S;V) (127) 

for sufficiently large n. 

2) Assume satisfies the above bound. By the conditional joint typicality lemma [|22ll . 
Pr {E^^^ n -Eg) — )■ for sufficiently large n. 

3) Focus on ^ Pt (E^): 

To bound this term as tight as possible, we define another event for each block here, 

(tj-i) = {tj-i),yU) e A^} (128) 

Notice that Aj_i C Bj_i, hence: 

J2 P^(^w) (129) 

wt^Ia/ \ t'' i=i / 

= E ( (n^. (w,t„vi)) n (n5,_i(t,_i)) ) (132) 

w^Ia/ tb \ \i=l / \J=2 / / 

^ E E n ^^-1) ^ ^^-1 (^^-1)) ) (133) 
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6-1 

w^^Ia/ t'' i=2 

where (11311) holds by the union bound; (|132b holds by the fact that Aj-i C 
(11331 ) follows from < Pr (Aj) < 1, and (11341) follows from the chain rule and again 
exploits the fact that Aj^i C Bj^i. 

Define sets Sj (w) C [1 : M] such that Sj (w) = {k : Wk ^ 1} for each block j. We 
have the following lemma, which upper bound each probability term in (11341) . 
Lemma 1: For any fixed (w,iSj(w)), we have: 

Pr (A, (w, t„ t,_i) n (t,_i) 15, (t,) ) 

r 1 2-n(/(X([l;M]),y;r)-2.-.') y , _ / ^ 

-| _J_2-n(/(X(5,(w));y|f/,y,X(5;(w)))-£-e')^^^_^ ^ 

^Pr(A7^^'-(w,t,_0) (136) 

where the upper bound is independent of tj, for arbitrary e, e' > and sufficiently 
large n. 
Proof: 

The proof of this lemma follows from the standard argument on joint typicality [|23l 
Theorem 15.2.1] and is deferred to Appendix O ■ 
Thus we have, 

J2 P^(^w) (137) 

6-1 

^ E EnP^(^^ (^'^i'^^-i)^^^-i(Vi)l^.(^.)) (138) 

wt^Im t*- i=2 

^ E EnP^K"(^'Vi)) (139) 

wt^Im t*- i=2 

= E E nEp^K"(^'^^-i)) (140) 

w^lj\f tfcjtfc-i i=2 tj_i 

E Kfc / 1 \ ''"^ 

^ ^ 2 fes5(w) c2^nRv I 2 X 2~"(™™(-'-'"i'-'-'"2)) J (141) 

5(w)C[l:M], \1 ^ / 

5(w)7^0 
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with 

= / (X ([1 : M]) ,V;Y)-R,-2e- e' , (142) 
I„2 = / (X (5 (w)) ■,Y\U,V,X (5^ (w)) ) - e - e' (143) 

for arbitrary e, e' > and sufficiently large ra, where (11391 ) and (11411) follows from 
Lemma [H and (11401 ) holds because Pr (^A^"^^^ (w, tj-i)) is dependent of only for 
any fixed w. 

Setting 6 — > oo and n — oo, XI (-^w) —J- as long as: 

Y,Rk< min (1^1, 1^2) , V 5 C [1 : M] (144) 
fee5 

Note that Imi is not dependent of 5(w) and thus we only need to keep: 

Rk<I{y^{[l:M]),V-Y)-R, (145) 

fce[l:A/] 



<I{y.{[l:M]),V-Y)-I{V-S) (146) 



and 



^ < / (X (5) ; r I ?7, V^, X (5=) ) , (147) 

fce5 

V5 C [1 : M] (148) 

which establishes the achievable rate region T^f^''^ ...,/3m) given in (1721) . Further- 
more, exploiting time-sharing among different cost constraints, we establish the T- 
achievable rate region. 

Appendix C 
Proof of Lemma [H 

For any fixed (w,5j(w)), we have the following enumeration: 

1) V ^ 1: 

x" (w \tj-i) is independent of and also v^^i {tj-i) is independent of y'j^i- Thus: 
a) tj = 1: 

Pr {A, (w, t„ t,_i) n (t,_i) \B, it,) ) (149) 
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< __}_2-'n{IiXill:M]),U;V,Y)-e)2-n(IiV;Y)-e) ^^^^q^ 



1-e 



}_2-n{I{Xill:M]),U;Y\V)+IiV;Y)-2e) ^j^j-j 



1 - e' 



= _J_2'n{I{Xill:M]),U,V;Y)-2e) ^^2) 

1 — e' 

1-e' ^ ^ 

where (11511) holds because V is independent of (X ([1 : M]) , U) while (11531) follows 
from the Markov chain U ^ {X {[1 : M]) ,V) ^ Y . 
b) 7^ 1: 

Pr {Aj (w, t„ n 5,„i (t,) ) (154) 

1-e' 

Therefore, 

Pr (A, (w, t,, t,_i 7^ 1) n 7^ 1) {t,) ) (157) 

< ^^2^"(^(^([^^*^])'^'^)-2^-"') (158) 
~ 1 — e' 

^Pr(A7^-'(w,t,_i^l)) (159) 

where the upper bound is independent of tj, for arbitrary e, e' > and sufficiently large 
n. 

2) tj^i = 1: 

Note the fact that x" {Sj (w) is independent of y^. Thus: 

a) t, = 1: 

Pr {Aj (w, tj, tj^i) n (t,„i) |5, (t,) ) (160) 

(161) 



< 1 o-n(;/(x(.S,);y|^,y,xf.S?))-e) 

-1-e' 



b) 7^ 1: 



Pr (A, (w,t„ Vi) n (t,_i) 15, (t,)) (162) 



1 



2-n(/(x{5,);y|c/,y,x(5|) )+/(v;y|x(5-) )-6) 
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1 - e' ^ ^ 

1 — e' 

where (11651) holds since 

J(V;F|X(5|)) -/(V;F) = iy(y|F)-if (V|X(5j),F) > 0. 
Therefore, 

Pr {A, (w, t„ = 1) n (t,_i = 1) \B, it,) ) (166) 

(167) 

^Pr(A7^^''(w,t,„i = l)) (168) 

where the upper bound is independent of tj for arbitrary e,e' > and sufficiently large n. 
Collecting all results from the enumeration above, we establish the lemma. 



< _J_2^n{i{xis,y,Y\uy,x{s-))^e^e') 
- 1-e' 
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